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Abstract 



Neutral B meson oscillations in the B° — B° and B° — systems were studied 
Ph. using a sample of about 4.0 million hadronic Z decays recorded by the DELPHI 

detector between 1992 and 2000. Events with a high transverse momentum lep- 
ton were removed and a sample of 770 k events with an inclusively reconstructed 

vertex was selected. 

The mass difference between the two physical states in the B° — B° system was 



measured to be: 

Am d = (0.531 ± 0.025(stat.) ± 0. 007 (syst.))^ 1 . 

The following limit on the width difference of these states was also obtained: 

|Ar Bd |/r Bd < 0.18 at 95% CL. 

As no evidence for B° — B° oscillations was found, a limit on the mass difference 
of the two physical states was given: 

Am, > 5.0 ps" 1 at 95 % CL. 

The corresponding sensitivity of this analysis is equal to 6.6 ps _1 . 
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1 Introduction 

In the Standard Model, B° q — B° q (q = d, s) mixing is a direct consequence of second 
order weak interactions. Starting with a B q meson produced at time t—0, the probability 
density V to observe a B q decaying at the proper time t can be written, neglecting effects 
from CP violation: 

V(B° q ^B° q ) = ^e" r ^[cosh(^t) + cos(ArV)]. 

Here T q = -^—^ — -, Ar, = T q — T q , and Am, = m q — m q , where H and L 
denote respectively the heavy and light physical states. The oscillation period gives a 
direct measurement of the mass difference between the two physical states. The Standard 
Model predicts that Ar <C Am [1] . Neglecting a possible difference between the lifetimes 
of the heavy and light mass eigenstates, the above expression simplifies to: 

p _. = V{B o^ B O q) = i- e -*[l + cos(ArV)] (f) 

and similarly: 

V$°- =V(B° q ^B°) = J- e -i[l-cos(ArV)], (2) 
where r q is the lifetime of the B q . 

In the Standard Model, the Bj - B° q (q = d, s) mass difference Am, (having kept only 
the dominant top quark contribution) can be expressed as follows [I]: 




In this expression Gp is the Fermi coupling constant; F(x t ), with x t = results from 

m w 

the evaluation of the box diagram and has a smooth dependence on x t . t]b is a QCD 
correction factor obtained at next-to-leading order in perturbative QCD. The dominant 
uncertainties in Eq.(3) come from the evaluation of the B meson decay constant /b and 
of the "bag" parameter B Bq [2]. In terms of the Wolfenstein parametrization [3], the two 
elements of the Vckm matrix are equal to: 

\v td \ =A\ 3 y/(i-p) 2 + v 2 ; \V ts \ = A\ 2 , (4) 

neglecting terms of order 0(A 4 ). At this order | 14s I is independent of p and rj and is 
equal to | V C 6 |. Eq. (3) relates Am^ to |14d|- It defines a circle in the p — rj plane. 
Nevertheless the precision on Am d cannot be fully exploited due to the large uncertainty 
which originates in the evaluation of the non-perturbative QCD parameters. 

The ratio between the Standard Model expectations for Am d and Am s is given by: 

Am d m B J^ d B Bd r] Bd \V td 
Am s m B Jl s B Bs r] Bs \V ts 

A measurement or a limit on the ratio gives a circular constraint in the p — i] plane. 
This ratio depends only on the ratio of the non-perturbative QCD parameters which is 
expected to be better determined than their absolute values which occur in Eq. (3). 
Using constraints on p and rj from existing measurements (except those on Am s ), the 



(5) 
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distribution for the expected values of Am s can be obtained. It has been shown that 
Am, should lie, at 95% C.L., between 9.7 and 23.2 ps" 1 [2]. 

Using the DELPHI data, several analyses searching for B° s — B° s oscillations have been 
performed on selected event samples of exclusively reconstructed B° mesons, D s -lepton 
pairs, D s -hadron pairs and events with a high transverse momentum lepton [4]. In this 
analysis events with a high transverse momentum lepton have been removed and the 
remaining events are used to search for B° oscillations and to measure the B° oscilla- 
tion frequency. Two analyses will be described: one inclusive vertex analysis based on 
a probabilistic approach using the data set from 1992 to 2000 and one based on neural 
networks optimized for high values of Am s using only the 1994 data. To avoid overlap 
with other analyses [4], events with a high transverse momentum lepton are removed 
from the sample. Both analyses reconstruct an inclusive secondary vertex which is used 
to estimate the proper time. Events that mix are selected using a tag based on several 
separating variables which are combined using probabilities or neural networks respec- 
tively. The neural network analysis will provide a check and a confirmation of the results 
and in particular of the sensitivity at high values of Am s . 

The inclusive vertex analysis is presented in section 2, describing the secondary vertex 
and proper time reconstruction, the production and decay tags and the fitting programme. 
The measurement of the B° — B° oscillation frequency is described in section 2.7 and the 

results of the search for B° — B° oscillations are presented in section 2.8. In section 3, the 
neural network analysis is described, while the conclusions are presented in section 4. 

The results presented in this paper will be combined later with other DELPHI and 
LEP results. 

2 Inclusive vertex analysis 

For a description of the DELPHI detector and of its performance the reader is referred 
to [7]. The analysis described in this paper used the precise tracking based on the silicon 
microvertex detector to reconstruct the primary and secondary vertex. To estimate the 
B momentum and direction, the neutral particles detected in the electromagnetic and 
hadronic calorimeter and the reconstructed tracks were used. Muon identification was 
based on the hits in the muon chambers being associated with a track. Electrons were 
identified using tracks associated with a shower in the electromagnetic calorimeter. The 
dE/dx energy loss measurement in the Time Projection Chamber and the Cherenkov 
light detected in the RICH were used to separate pions (and also electrons or muons) 
from kaons and protons. 

Tracks were selected if they satisfied the following criteria: their particle momentum 
was above 200 MeV/c, their tracklength was at least 30 cm, their relative momentum 
error was less than 130%, their polar angle was between 20° and 160° and their impact 
parameter with respect to the primary vertex was less than 4 cm in the xy plane (perpen- 
dicular to the beam) and 10 cm in z (along the beam direction). Neutral particles had 
to deposit at least 500 MeV in the calorimeters and their polar angle had to lie between 
2° and 178°. 

To select hadronic events it was required that more than 7 tracks of charged parti- 
cles were accepted with a total energy above 15 GeV. The thrust direction was deter- 
mined using charged and neutral particles and its polar angle was required to satisfy 
| cos(9thrust)\ < 0.8. The event was divided into hemispheres by a plane perpendicular to 
the thrust axis. In each hemisphere the total energy from charged and neutral particles 
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had to be larger than 5 GeV. In total about 4 million hadronic Z decays were selected 
from which 3.5 million were taken in the LEP I phase (1992-1995) and 500k were collected 
as calibration data in the LEP II phase (1996-2000). 

Using tracks with vertex detector information, the primary vertex was fitted using the 
average beamspot as a constraint [5]. For each track the impact parameter with respect 
to the primary vertex was calculated and the lifetime sign determined as explained in 
the paper quoted above. The b tagging probability 1 P£ is a measure of the consistency 
of these track impact parameters with the hypothesis that all selected tracks came from 
the event's production vertex. Events without long-lived particles should have a uniform 
distribution of P%, while those containing a b-quark tend to have small values. In the 1992 
and 1993 data the vertex detector measured only the R(p (R being defined as \/ x 2 + y 2 
and <fi the azimuthal angle) coordinate, while from 1994 to 2000 the z coordinate was also 
measured. In the 1992-1993 data, events were selected if the b tagging variable P£ was 
less than 0.1, whereas in the 1994-2000 data, the cut could be placed at 0.015. 

Jets were reconstructed using charged and neutral particles by the LUCLUS [6] jet 
algorithm with an invariant mass cut DJOIN of 6 GeV/c 2 . Leptons were identified 
and their transverse momentum with respect to the jet axis was determined. Loosely 
identified muons with momenta above 3 GeV/c were accepted as well as standard and 
tightly identified muon with momenta above 2 GeV/c. The reader is referred to [7] 
for the identification criteria. Events with a standard or tightly identified muons with 
momentum above 3 GeV/c and a transverse momentum above 1.2 GeV/c were removed 
from the selected event sample. This was done to avoid overlap with other analyses that 
use leptons [4] with a high transverse momentum. For electron identification a neural 
network was used with a cut value that corresponds to 75% efficiency [7]. The electron 
had to have a momentum above 2 GeV/c. Electrons with a momentum below 3 GeV/c 
had to pass a cut value that corresponds to 65% efficiency Again to avoid overlap with 
other analyses that use high transverse momentum leptons, events with an electron with 
momentum above 3 GeV/c and a transverse momentum above 1.2 GeV/c satisfying a 
cut value that corresponds to 65% efficiency were removed. The selected electrons and 
muons will henceforth be referred to as soft leptons. 

Samples of hadronic Z decays (4 million events) and of Z bosons decaying only into bb 
quark pairs (2 million events) were simulated using the Monte Carlo generator programme 
JETSET7.3 [6] with DELPHI tuned JETSET parameters and updated b and c decay 
tables [8]. The detailed response of the DELPHI detector was simulated [9]. 

2.1 Secondary vertex reconstruction 

The secondary vertex reconstruction and proper time determination procedures are 
identical for events with or without a soft lepton. First the probability Pi that a charged or 
a neutral particle comes from the secondary (bottom or charm) vertex was parametrized 2 . 
The following information was used for tracks: the lifetime-signed impact parameters and 
their errors (in R(f) and Rz), the transverse momentum with respect to the jet axis, the 
muon and electron identification and the rapidity 3 with respect to the jet axis. For 
neutral particles the transverse momentum and rapidity were used. For each of these 
quantities the probability was parametrized using the simulation. The total probability 
was obtained by combining these individual probabilities assuming they are independent. 

1 E refers to the fact that the total event was used and the + sign means that the lifetime sign had to be positive. 
2 Thus a Pi value of 0.8 means that 80 percent of the selected particles will come from the secondary vertex. 
3 For calculating rapidities, charged and neutral particles were assigned the pion mass. 
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To start the first level secondary vertex fit, tracks were selected with at least one 
associated hit in the vertex detector and a probability Pj larger than 60%. The decay 
length - i.e. the 3-D decay distance - per track was determined by calculating the point 
of closest approach of the track to the B particle trajectory which was approximated by a 
track coming from the primary vertex and having the direction of the reconstructed jet. 
The first level secondary vertex was fitted using the measured decay lengths per track 
and their errors, the azimuthal and polar angles of the tracks and the B trajectory. The 
result of this approximate fit was a decay length, its error and a x 2 of the fit. Further, the 
Xt contribution of each track to the total x 2 was determined. To remove tracks coming 
from the primary vertex the following iterative procedure was performed: if the secondary 
vertex was reconstructed with more than two tracks, the track upstream of the vertex 
(i.e. closer to the fitted primary vertex) with the largest Xt was removed if its Xt was 
larger than 4. Secondly, tracks were removed that did not combine with any of the other 
tracks. To achieve this, all two track combinations were made and the number of good 
matches was counted. A good match was defined as a two track vertex that was within 
2 standard deviations of the fitted secondary vertex. For each track, the fraction f goo d 
of good matches to the total number of combinations was determined. The track with 
the smallest f goo d value was removed if its value was below 20%, and then the first level 
vertex fit was redone. The procedure ends when no track could be removed by the listed 
criteria. 

At the end of this procedure a full vertex fit was performed using the measured track 
parameters and the corresponding covariance matrices. To the list of tracks selected 
for the fit, the B-track with its covariance matrix was added as a constraint. For each 
track the impact parameter and its error with respect to the fitted secondary vertex were 
calculated. The global x 2 of the fit was defined as the sum of the squares of the track 
impact parameters divided by corresponding uncertainties (in R<p and Rz). As a result 
the B decay length and its error were obtained. 

The presence of tracks from charm particle decays in the vertex fit has two effects. 
Firstly, the fitted vertex does not coincide with the B vertex, but is some average between 
the B and D vertex positions. Secondly, the x 2 °f the vertex increases because of the 
charm decay length. It was therefore important to remove as much as possible the decay 
products of charmed particles from the vertex fit. For this purpose the probability that a 
track came from charm was evaluated on the basis of kinematic and vertex information. 
For example, the momentum distribution of particles from charm, in the B rest frame, 
is softer than that for particles from B decays. Secondly, a particle from charm decay is 
produced downstream of the fitted vertex, while a particle from a B hadron originated 
upstream of this vertex. Two new vertex fits were performed. In the first, one particle 
that most likely originates from charm was removed. In the second fit, the two particles 
most likely to come from charm were removed. 

Using the simulation, an estimate was made of the B decay length and of its error, 
using as an input the fitted decay length, its associated (or raw) error, the x 2 an d the 
number of fitted tracks. The expected error on the B decay length was parametrized in 
the same way. This was done for the three vertex fits (removing 0, 1 and 2 particles as 
described in the previous paragraph). Removing 1 or 2 particles has the advantage of 
reducing the bias caused by the presence of particles from charm. On the other hand the 
resolution is degraded if a track is removed. Due to the fact that the x 2 is sensitive to the 
presence of particles from charm, part of the bias is corrected for in the parametrization 
of the B decay length. Finally, out of the three vertex fit results, the result with the 
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smallest expected error on the B decay length was chosen. In 51% of the cases no track 
was removed, in 36% one track and in 13% two tracks were removed. 

In Figures la and b the raw error as it comes out of the full vertex fit and the re- 
constructed minus the B decay distance divided by the raw error are shown for the 
1994-1995 simulated events. The tail due to the presence of charmed particles can be 
clearly observed. Figures lc and d show the expected error and the reconstructed minus 
the simulated B decay distance after applying the correction procedure described above. 
The latter distribution is clearly more Gaussian and its width is close to unity. 
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Figure 1: Figure a) shows the expected or raw error, Figure b) the reconstructed minus 
the simulated B decay distance divided by the raw error for the 1994-1995 simulation. 
Figures c) and d) show the expected error and the reconstructed minus the simulated B 
decay distance after applying the procedure described in the text. 



2.2 Proper time reconstruction 

To determine the proper time, the momentum of the B hadron had to be measured. An 
estimate of the energy of the b jet was made, applying energy-momentum conservation 
to the whole event. The masses of the jet containing the B hadron and of the system 
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formed by the remaining charged and neutral particles, labelled respectively M 1 and M 2 , 
were measured. The b jet energy was obtained as: 

E jet = E cms /2 - {M 2 2 - Ml) I '(2 E cms ), (6) 

where E cms is the centre of mass energy. This significantly improved the b jet energy 
resolution. The B energy was determined as: 



E B = % I *- E jet , (7) 



where Ei is the energy of the charged or neutral particle and Pi is the probability that a 
particle comes from the decay of a B hadron (see section 2.1). 

The momentum of the B hadron was determined from the B energy and a small 
correction typically of order 10% was applied as a function of the following quantities: 
the weighted (with Pi) number of charged and neutral particles, the ratio of the raw B 
energy (£V PjE*) to the jet energy Ej et , the invariant mass Mi, the ratio of the charged 
over the total raw B energy and the number of jets. The corrections were obtained from 
the simulation. The reconstructed B momentum is shown in Figure 2. 

The expected error was parametrized as a function of the uncorrected B energy and of 
the jet energy. It lies between 3 and 9 GeV/c and is on the average equal to 5 GeV/c. The 
reconstructed minus simulated B momentum divided by the expected error for simulated 
events is shown in Figure 2. 

The proper time t was calculated using: 

t = — (8) 
V 

where m is the B mass, L the decay length and p the estimated B momentum. The 
expected error a t on the proper time was estimated using: 



,, mSL 2 mLSp 2 

°t = ) + (-^~) ' ( 9 ) 

where SL is the expected error on the decay length and bp is the error on the momentum. 
The data were divided into eight categories according to the value of the proper time 
resolution. This division was made because most of the sensitivity at high values of Am s 
came from events with the best proper time resolution. The cuts are given in Table 1. 
To fall into the first category, the expected resolution had to be smaller than 0.12+0.07t 
ps (t in ps units). Events with a resolution worse than 0.35+0.2t ps were rejected. 

The first four categories refer to events with a soft lepton and the last four to events 
with only an inclusive vertex. The soft lepton sample consists of 155023 events. The 
latter sample will be referred to as the inclusive vertex sample and consists of 614577 
events. The proper time resolutions for the last four classes are shown in Figure 3. 

The systematic error on the decay length resolution was estimated to be ±10%. This 
number was obtained in the following way. First, a comparison of data and simulation 
for the expected decay length error (see Fig. 4) showed that the data show a discrepancy 
for a scale error of less than ±5%. Secondly, the description of the impact parameters 
of the tracks with negative lifetime sign allow for a scaling of the associated error of less 
than ±5% [5]. Finally, a study was made of the amplitude error (see section 2.8) as a 
function of Am s comparing data and simulation. The amplitude error increases because 
of the finite proper time resolution. The amplitude error for data and simulation are in 
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Figure 2: Upper diagram: Reconstructed B momentum. The dots correspond to the 1992 
to 2000 data, the solid line to the measured momentum distribution obtained from sim- 
ulated events. Lower diagram: The reconstructed minus the true B momentum divided 
by the expected error for simulated events. 



agreement within ±10%. This is mainly due to the fact that the numbers of events in 
each category agree for data and simulation. 

The systematic error on the momentum resolution was estimated to be ±10%. This 
number was obtained in the following way. Comparing the observed momentum in a 
hemisphere with the expected momentum in that hemisphere - obtained using energy 
and momentum conservation - for data and simulation, it was found that the momentum 
resolution agreed to better than ±10%. Finally, the study of the amplitude error, men- 
tioned above, showed that the amplitude error for data and simulation was in agreement 
within ±10%. 
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category 


1 


2 


3 


4 


crtips) 


0.12+0.07 t 


0.18+0.08 t 


0.25+0.1 t 


0.35+0.2 t 


soft lepton sample 


22740 (5533) 


41597 (10598) 


42835 (12091) 


47851 (15620) 


category 


5 


6 


7 


8 


crtips) 


0.12+0.07 t 


0.18+0.08 t 


0.25+0.1 t 


0.35+0.2 t 


inclusive vertex sample 


68875 (16476) 


146075 (36633) 


171859 (47702) 


227768 (73809) 



Table 1: Cuts on the resolution a t and total number of selected events (in parenthesis 
the number of events corresponding to the 92-93 data) for the different categories. 





reconstructed-generated time (ps) 

Figure 3: Reconstructed minus generated proper time for the inclusive B vertex sample 
corresponding to categories 5 to 8. The dots correspond to the simulated data and the 
histograms to the parametrization of the resolution function (see section 2.5). 
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Figure 4: The expected error on the decay length for 1992-2000 data (points with error 
bars) and simulation (solid line). 
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2.3 Production and decay tag 

To distinguish between events in which the neutral B meson has mixed or not, a 
production and a decay tag were defined. They give, respectively, the b-flavor content 
(ie b or b) of the B hadron at production and decay times. In this analysis both the 
production and decay tags were optimized for B° mesons. In Z decays, b and b quarks 
are produced back to back, in pairs. The hemisphere opposite to the decaying B can 
therefore be used to tag the flavour at production time. This will be called the opposite 
side production tag which is obtained from a combination of several variables: 

• the average charge of a sample of tracks, attached to the b-jet, and enriched in 
b-decay products: 

Qjet = E QiPn/^PiL with Pi > 0.5, 

where pn is the component of the momentum of the particle along the jet axis direction, 
and Pi is its probability that it is a B decay product, as defined at the beginning of section 

2.1; 

• the average charge of a sample of tracks, attached to the b-jet, and enriched in 
b-fragmentation products: 

Qf = J2qiPi L (Pi < 0.5)/J2 P i L (el\Pi). 

Note that the denominator sums over all particles, because the fraction of the total 
longitudinal momentum that is coming from fragmentation particles is relevant; 

• the charge and momentum p* of any identified lepton, in the B rest frame, which is 
determined from the inclusively reconstructed B momentum vector; 

• the heavy particle charge for an identified kaon or proton and its momentum p* in 
the B rest frame. 

Using simulation, distributions for these variables were obtained for B and B hadrons. 
These variables were converted into probabilities Phi for the b hypothesis, and then 
combined to give the opposite side production tag. This was done in the following way. 
For each variable a rejection factor Ri is defined as l Ph p b . and a combined rejection factor 
R is obtained by taking the product of the rejections Ri. The combined probability P 
is then equal to In Figure 5 the distribution of the opposite side production tag is 
shown for 1992-2000 data and simulation. The tagging purity is defined as the fraction 
of correct flavour assignments at 100% efficiency, i.e. all events were classified if the cut 
on the combined probability was set at 0.5. A purity equal to 68% has been measured 
on 1992-2000 simulated events. 

A same side production tag is also defined using the fragmentation tracks accompa- 
nying the decaying B meson. Both leading fragmentation pions and kaons are sensitive 
to the b or b production flavour. The following quantity Q sa me was defined: 

Qsame = R{PiL, hi)(l - Pi)q U 

where hi is equal to 1 for a heavy (proton, kaon) or to for a light (electron, muon or pion) 
particle and the sum extends over all tracks. The parametrization of the function R - a 
polynome as a function of pn, - was obtained using simulated events. The variable Q sa me 
was converted into a probability and then combined with the opposite side production 
tag to give the combined production tag P pro d- 

In Figure 5 the distribution of P pro d is shown for 1992-2000 data and simulation. 
The uds and charm quark contributions are small (see Table 2) and are included in the 
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total distribution. The tagging purity for B s mesons is equal to 71% for the 1992-2000 
simulation. As expected, this value is higher than the result, 64%, obtained using the 
opposite side production alone [4]. The difference between data and simulation for the 
combined production tag, which is apparent in Fig. 5 will be taken into account by fitting 
the tagging purity for the data (see section 2.6). 

1992-2000 data 




0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 

production tag opposite side 




0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 

production tag same and opposite side 



Figure 5: Production tag using only information from the opposite side (upper diagram) 
and the production tag using information from both sides (lower diagram). The dots 
correspond to the 1992 to 2000 data, the solid line to the simulation. The hatched areas 
correspond to the b (left) and b (right) contributions. 

The other important variable in the analysis is the decay tag. In the soft lepton sample 
this tag is relatively straightforward using the charge of the lepton. Most of the B — B 
separation comes from the momentum p* of the lepton in the B rest frame that allows 
the separation of a prompt lepton coming from the B vertex from a lepton coming from a 
charm decay. Other information in the event (such as, for example, the impact parameter 
of the lepton with respect to the secondary vertex and the isolation of the lepton (presence 
of other tracks from charm decay vertex)) helps to improve the B — B separation. Finally, 
also the decay tag developed for the inclusive vertex sample (discussed below) was added 
to improve the performance slightly. 
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For events with no lepton, obtaining a decay tag is more difficult. The following 
approach was taken. All the charged and neutral particles were boosted back in the 
B reference frame using the estimated B momentum and direction (see section 2.2). 
The B-thrust axis was determined in the B reference frame using charged and neutral 
particles with Pj greater than 0.5. The particles were assigned to the forward or backward 
hemisphere. Usually one hemisphere contains most of the tracks from the B vertex while 
the other contains most of the tracks from charm decay. This is called a dipole, as the B° 
decays to a DJ^ and a virtual W + and the charge difference between the two hemispheres 
is equal to ± 2. Under the hypothesis that the forward (backward) hemisphere contains 
the particles from the charm decay and the backward (forward) hemisphere the particles 
from the B vertex, the flavour probability of the decaying B° s is evaluated. This is achieved 
by using the charge and the momentum p* in the B rest frame of the heavy (p, K) and 
light (e,/x,7r) particles. For these parametrizations, the simulation was used. Then a 
hemisphere probability is evaluated for the hypothesis that the charmed particle is in the 
forward (backward) hemisphere. This probability depends on the lifetime- signed impact 
parameter of the tracks with respect to the secondary vertex, on their momenta in the B 
rest frame and on the hemisphere multiplicity. By combining the hemisphere probability 
with the flavour probability, the decay tag for the inclusive vertex sample was obtained. 
The tag was optimized for B s mesons. 

In Figure 6 the performance of the decay tag Pdecay for the soft lepton sample is shown 
for 1992-2000 data and simulation. The uds and charm quark contributions are small 
(see Table 2) and not shown explicitly. The tagging purity is 69% at 100% efficiency. 
The events with Pdecay from to 0.02 and 0.98 to 1 are due to prompt B decays with a 
high p* value. The performance of the decay tag for the inclusive vertex sample is also 
shown. The B° s tagging purity is 58% at 100% efficiency. The difference between data 
and simulation for the decay tag will be taken into account by fitting the purity for the 
data, as is discussed in section 2.6. 



2.4 Sample composition 

For the sample composition the following B-hadron production fractions were assumed 
[10]: / B . = 0.097 ± 0.011, / Bbajyons = 0.103 ± 0.017, / Bu = / Bd = 0.40. For the lifetime 
of the different B species it was assumed that [10]: r Bu = 1.65 ps, r Bd = r B o = 1.55 ps 
and t Bk = 1.20 ps. 

- LJ baryons 1 

Using the simulation, the uds and charm backgrounds were extracted. The background 
fractions for the different data sets and vertex categories are listed in Table 2, where f u d s 
is defined as the number of uds events divided by the total number of events in the 
sample. 



background 


data set 


cat 1 


cat 2 


cat 3 


cat 4 


cat 5 


cat 6 


cat 7 


cat 8 


fuds 


1992-1993 


.0074 


.0158 


.0288 


.0495 


.0226 


.0407 


.0717 


.1237 


fuds 


1994-2000 


.0046 


.0076 


.0117 


.0229 


.0138 


.0199 


.0329 


.0588 


f charm 


1992-1993 


.0202 


.0653 


.1116 


.1779 


.0359 


.0920 


.1433 


.1900 


f charm 


1994-2000 


.0356 


.0673 


.1201 


.1919 


.0436 


.0928 


.1514 


.2004 



Table 2: The background fractions for the 1992-2000 data sets divided according to the 
different vertex categories. 
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1992-2000 data 
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Figure 6: Decay tag for the soft lepton sample (upper diagram) and inclusive vertex 
sample (lower diagram). The dots correspond to the 1992 to 2000 data, the solid line to 
the simulation. The hatched areas correspond to the b (right) and b (left) contributions 
at the time of decay. 



2.5 Fitting programme 

The fitting programme provided an analytic description of the data for the like- and 
unlike-sign tagged events. It was used to fit the amplitude of B° — B° oscillations. In 
the fitting program the time resolution function lZ(t rec — t true ,t true ) was parametrized. 
The resolution function gives the probability that, given a certain value for the true 
proper time t true , a proper time value t rec is reconstructed. Two asymmetric Gaussian 
distributions 4 are used to describe the main signal, as well as one asymmetric Gaussian 
to describe poorly measured events and one Gaussian to describe the probability that the 
secondary vertex is reconstructed near to the primary vertex. The widths of the Gaussian 

distributions are of the form a = y / ' + <Jpti rue with a p being the relative momentum 

resolution. The relative normalizations of the Gaussian distributions are left free to vary 
and parametrized as a constant plus a term proportional to 1 — e~ ttrue / T , where r is the 

4 The asymmetric Gaussian has two widths, one for proper time values above the central value, the other for below. 
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average b lifetime. For each vertex category the time resolution function was fitted and 
the result of the fit is shown in Figure 3. The effect of different parametrisations of the 
resolution function was found to be neglible with respect to the effect of a change in the 
proper time resolution (see section 2.2). 

The probability Vb{t r ec) for a B event to be observed at a proper time t rec was written 
as a convolution of an exponential B decay distribution with lifetime r, an acceptance 
function A(t) and the resolution function: 

/•oo e -t/r 

V b (t rec )= A(t)K(t rec -t,t) dt. (10) 

Jo r 
The acceptance function was parametrized for the different vertex categories using 
the simulation. The difference in acceptance for the different B species was found to 
be negligible. Due to the requirements on the flight distance in the track selection, the 
acceptance is a smooth, but not flat, function of the proper time. The probabilities for 
uds (V u ds) and charm (V c ) events for the different vertex categories are parametrized 
as a function of t rec with exponential functions whose slopes are determined using the 
simulation. 

Like- or unlike-sign tagged events are those events for which P com b is respectively larger 
or smaller than 50%. The combined tagging probability P com b is defined as 

Pcomb PprodPdecay ~\~ (1 Pprod){^- Pdecay)- (1-Q 

The tagging purity e Bq is expressed in terms of the combined tagging probability P CO mb- 
For B° events it is given by: 

e Bs = 0.5 + \P comb - 0.5| (12) 

The tagging purities for the other B particles and for the charm and light quark back- 
ground events were also expressed as functions of P com b {P pro d and Pdecay) using the 
simulation (see section 2.6). 

The total probability to observe a like-sign tagged event at the reconstructed proper 
time t rec is: 



V Hke (t rec ) = f b K^V^ C \{trec) + h £ /B q (l-e Bq )nTc m C(W) + 

q=d,s q=u,d,s,baryons 

f c (l — e c )V c {t r ec) + /«ds(l — ^uds)Puds{Uec) (13) 

and correspondingly for an unlike-sign tagged event: 

v uniik e (g = /^/B,(i-^« q (U+/ 6 E fB^v~ x (t rec ) + 

q=d,s q=u,d,s,baryons 

fc^cPcitrec) fuds^udsPudsi^rec)' (14) 

For the mixed B° and B° mesons one has the following expression (q = d,s): 

PCX) 

V™:^{t rec )=\ A(t)K(t rec -t,t)V™(t)dt, (15) 
Jo 

while for the unmixed case the B u and B baryons also have to be included (q = 
u, d, s, baryons): 

POO 

'PSZgitrec) = / A(t)TZ(t rec - t, t)V^ X it)dt, (16) 

where V^ n)mx -(t) are defined in Eqs. (1) and (2). 
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2.6 Modelling the simulation and data 

The present analysis uses the production and decay probabilities on an event-by-event 
basis. The tagging purity cb s is calculated from these quantities as defined in Eq. (12). 
The production and decay probabilities - and thus the tagging purities - for the different 
B species, charm and light quarks are different. These differences have to be parametrized 
in the analytic fitting programme. For this simulated events were used. 

The new parametrization is obtained by modifying the probability P (P pro d or Pdecay)- 
For this purpose a parameter a is introduced and the new probability is defined as: 

P new = R a /(i + R a ), (17) 

where the rejection R is defined as j^p. A parameter value of 1 means that the probability 
remains unchanged. It was found out on simulation that this particular parametrisation 
gives an accurate description of the tag performance for neutral B species. For example 
the a parameter for a B hadron is obtained in the following way. Using the simulation 
the distribution of the probability P is plotted separately for B and B hadrons. The two 
distributions are divided and fitted to the expression (17) leaving free the a parameter. 
This procedure is illustrated in Fig. 7 using the decay tag for the inclusive vertex sample 
for the Bd meson. 

For leptons, the decay purities for the different B species were studied on simulation 
and found to be very similar. The decay tag parameter for the soft lepton sample was 
therefore put equal to 1. The decay tag parameters for the inclusive vertex sample 
and soft lepton sample as well as the production tag parameter are listed in Table 3. 
The values are obtained from the simulation. For the charm quark a parameter ap, of 
4.2 is used if the probability lies between 0.2 and 0.8, otherwise a D —l. For the soft 
lepton sample the relatively high value of ap, of 4.2 is understandable, because a lepton 
coming from the charmed particle, at relatively low p*, will tag correctly the charge of 
the charmed particle. Note that the parameters «d and ap for the different B species are 
quite similar, except for the B u where ap varies as a function of P 5 between 0.75 and 1. 
For this reason, tagging purities for B n and for the other types of b hadrons have been 
controlled directly from the data, as explained in the following. From the new probability 
Pne«> uds ' c , the combined probability Pf ^ ds ' c is calculated using Eq. (11) and the purity 
eB q ,uds,c is obtained using: 

^B q ,uds,c = 0.5 + |P c B D -f ' c - 0.5|. (18) 

5 The functional form used is a = ao + ot\ e 2 l p -"- 5 l . 



Particle 


inclusive vertex sample 


soft lepton sample 


all events 




decay tag «d 


decay tag «o 


production tag ap 


B s 


1 


1 


1 


B d 


1.15 


1 


1.13 


B u 


0.75 to 1 


1 


0.3 to 0.8 


B baryon 


0.80 


1 


1.09 


uds 


0.20 


0.20 


0.80 


charm 


4.2 (P=0.2-0.8) 


4.2 (P=0.2-0.8) 


0.50 



Table 3: The parameters a for the production and decay tag for the different particles 
as obtained from simulation. 
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Figure 7: Distributions for the decay tag for the inclusive vertex sample for the 
(solid line) and (dashed line) mesons. The bottom plot shows the ratio of the two 
distributions with a fit of Eq.(17) giving the a parameter of 1.15. 



It is important also to have a correct modelling of the tagging purity for the data i.e. 
to have a good description of the like- and unlike-sign tagged events. Using the data for 
each category, a correction factor C to the parameter a is fitted: 



a 



data 
D 



C a D , (19) 

where C is determined from the fraction of like-sign tagged events. The C factor was 
determined iteratively and Am^ = 0.531 (see section 2.7) was finally used. For the soft 
lepton sample, the results are shown in Table 4. 

The 1992-1993 and 1994-2000 data sets have different performance for tracking and 
lepton identification and therefore the fitted C values can be different. The total error 
on C for the soft lepton sample is better than ± 5%. 

The parameter a for the decay tag in the inclusive vertex sample for a B u meson is 
different - it also varies as a function of the tagging probability -from those for the other 
B particles (see Table 3). By separating the inclusive vertex sample into one enriched and 
one depleted in B u particles it was possible to determine, from the data, the correction 
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data set 


category 


fitted value for C 


1992-1993 


1 


0.95 ± 0.05 




o 
I 


U.ol ± U.Uo 




3 


0.76 ±0.05 




4 


0.82 ±0.05 


1994-2000 


1 


0.77 ±0.04 




2 


0.69 ±0.04 




3 


0.83 ±0.04 




4 


0.84 ±0.05 



Table 4: The fitted correction factor C for the soft lepton sample. 

factor C for B u mesons and for the other particles. These samples were obtained by 
cutting on the secondary vertex charge. Three fits were performed. First it was assumed 
that the correction factors C for all types of B particles were identical. Then in a second 
fit it was assumed that C for the non-B u particles was equal to 1 and the value of the 
correction factor C for the B u particles was fitted. From the x 2 of the fit it was clear that 
the second fit result was preferred. The value of the correction factor C for B u particles 
was fixed to the average value between the first and second fit results and a third fit was 
performed leaving C free for non-B^ particles. The results for the final fit are shown in 
Table 5, where the errors quoted in the third column correspond to the statistical errors 
obtained in the first fit. If another procedure was chosen a different C value would 



data set 


category 


correction factor C 


correction factor C 






for B d , B s and B baryon 


for B u mesons 


1992-1993 


5 


0.75 ±0.07 


0.54 




6 


0.76 ±0.06 


0.54 




7 


0.72 ±0.07 


0.40 




8 


0.63 ±0.09 


0.20 


1994-2000 


5 


0.93 ±0.05 


0.60 




6 


0.94 ±0.04 


0.60 




7 


0.83 ±0.07 


0.40 




8 


0.63 ±0.09 


0.20 



Table 5: The fitted correction factors C for the inclusive vertex sample. 



have been obtained. The largest change in the C value for non-B u particles is quoted 
as a systematic error and amounts to ± 15%. The systematic error is larger than the 
statistical error. 

It was found, using the simulation, that the acceptance for the uds and charm quarks 
depends on the tagging purity. The acceptance A(t) for B events also varies slightly as 
a function of the tagging purity. This was taken into account in the like- and unlike-sign 
probability distributions. A comparison between data and simulation showed a slightly 
different acceptance function. The acceptance function was corrected to obtain better 
agreement between the data and the parametrisation in the fitting program. Note that 
for B° — B° and B° s — B° s oscillations, only the fraction of like-sign tag events is relevant 
and to first order the acceptance correction drops out. 
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In Figure 8 the distributions for the like- and unlike-sign tagged events, as a function of 
the proper time, corresponding respectively to the soft lepton sample and to the inclusive 
vertex sample, are shown. In these Figures, the events have been weighted by |cb s — 0.5|. 
In this way events with a higher tagging purity acquire a higher weight. Events with a 
purity of 0.5 carry no information and have a weight equal to zero. A good description 
of the data is obtained. 
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Figure 8: Reconstructed proper time distributions for the like- and unlike-sign tagged 
events corresponding to the soft lepton (upper diagram) and inclusive vertex (lower dia- 
gram) samples. The 1994 to 2000 data are shown as dots, the fitted parametrization is 
shown histogram. 



In Figures 9 and 10 the fractions of weighted - as described above - like-sign tagged 
events, as a function of the proper time, for the soft lepton sample and inclusive vertex 
sample, are shown for the 1992 to 2000 data. In these Figures, values of Am^ of 0.495 ps -1 
and Am s of 15 ps _1 are used in the parametrizations corresponding to the continuous 
lines. 
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Figure 9: Fraction of like-sign tagged events as a function of the reconstructed proper 
time for the soft lepton and inclusive vertex samples. The 1992 and 1993 data are shown 
as points with error bars, the parametrization is given as a solid line. 
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Figure 10: Fraction of like-sign tagged events as a function of the reconstructed proper 
time for the soft lepton and inclusive vertex samples. The 1994 to 2000 data are shown 
as points with error bars, the parametrization is given as a solid line. 
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2.7 Measurement of the oscillation frequency 

The mass difference between the two physical states in the B d — B° system was deter- 
mined by fitting the fraction of weighted like-sign tagged events - shown in Figures 9, 10 
- as a function of the reconstructed proper time t. The following expression was used for 
the number of weighted like-sign events: 

N like (t) = A(t) N b /b u ^^(1 - e b ) 

+ A(t)N b f Bh ^J^^(l-e b ) 

baryon 

+ A(t)N bfBs ^ (20) 

+A(t) N b / Bd £^i[(l - 2e d ) cos(Am d t) + 1] 
+ (1 - e c )N c (t) +\l - e uds )N uds (t). 

The total number of weighted events is equal to: 

N tot (t) = A (t) N b hse- t/TBi /T Bi + N c (t) + N uds (t). (21) 

The event-by-event tagging purity is used as a weight. The values for / Bi and for the 
B-hadron lifetimes were fixed at the values listed in section 2.4. N b is the total number 
of b quark events. The functions N c (t) and N uds as well as the acceptance A(t) were 
parametrized using the simulation. The total number of events from charm and uds 
quarks are obtained by intergrating these functions. The tagging purities e c and e uds 
were taken from the simulation. 

The like-sign tagged fraction was fitted in the range from 0.5 to 12 ps using a 
binned x 2 fit- First a fit was performed on the simulated data, i.e. the parametrization 
as shown in Figures 9 and 10. In this fit Am d , the B d mass difference, is fixed and e d , R i: 
R 2 and a are left free, where e d is the B^ tagging purity and the tagging purity for the 
other b hadrons e b is parametrized as: e b = R\ e at + R 2 e - '. The parameter R 2 takes into 
account the slight dependence of the tagging purity as a function of the proper time. 

In a second fit, the data were fitted leaving free Am d , e d , Ri and R 2 . The parameter a 
was fixed to the value of 8.5 10~ 3 ps _1 obtained in the previous fit to the simulation. The 
results for the different parameters are: e d = 0.575 ± 0.009 (0.579), R x = 0.550 ± 0.005 
(0.554) and R 2 = 0.080 ± 0.022 (0.059). Within parentheses are given the results for the 
fit to the simulated data. 

The result for the B d mass difference is Am d = 0.531 ± 0.025 (stat.) with a x 2 ''/ndf 
of 22.5/(23-4), as shown in Figure 11. 

The reason for performing a four parameter fit is that both tagging purities for the B d 
meson and for the other B particles are determined using the data. Therefore systematic 
uncertainties on these parameters were largely reduced. In this way the fit results become 
also less sensitive to, for example, the fraction of B° s mesons. Due to the fact that the fit 
was first applied and tuned to the simulated data, the resolution function is taken into 
account. 

A breakdown of the systematic errors affecting the measurement is given in Table 6. 
The range of values for the fractions and lifetimes of the different B species come from 
ref. [10]. The fractions of B° s mesons and B baryons were changed (correspondingly the 
other B fractions are recalculated) as well as the lifetimes and backgrounds. The tagging 
correction factor C (see section 2.6) was varied by a relative 5% for the soft lepton sample 
and by 15% for the inclusive vertex sample. The proper times were scaled with 1%, coming 
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Figure 11: Fraction of like-sign tagged events as a function of the reconstructed proper 
time using 1992-2000 data. The data were shown as points with error bars, the solid line 
corresponds to the fit. 



from the scale uncertainty on the reconstructed decay length and momentum, and the 
corresponding systematic error on Am^ was -0.0049 ps _1 . The resolution function 1Z (see 
section 2.5) was smeared by an additional Gaussian term with a width of 0.1+0.03t ps 
and the resulting shift in Am^ was 0.0037 ps -1 . This additional smearing corresponds 
to a 10% systematic error on the expected decay length resolution and a 10% systematic 
error on the expected momentum resolution. 

The total systematic error amounts to 0.007 ps -1 . 

The final result is thus: 

Am d = (0.531 ± 0.025 (stat) ± 0.007 (syst.)) ps" 1 . 

The total error is therefore 0.027 ps" 1 . 

This result for the mass difference between the two physical states in the B° — B° 
system is compatible with those from other experiments [10]. 

A fit was also performed to extract the width difference ArB d . In the fit, the expression 
inEq. (20) [(l-2e d ) cos(Am d t)+l] was replaced by [(l-2e d ) cos(Am d t) + cosh(Ar Bd t/2)] 
and the expression in Eq. (21) was modified accordingly. The result of the five parameter 
fit is |Ar B J = (0.00 ± 0.06)ps -1 . The total systematic error was evaluated for the error 
sources listed in Table 6 and found to be 0.0002 ps -1 . Using the measured lifetime 
r Bd = (1.55±0.03)ps [10], |Ar Bd |/r Bd = 0.00 ±0.09 (tot). The following upper limit was 
derived: 

|Ar Bd |/r Bd < 0.18 at 95% CL. 
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error source 


values 


J- x ' / — 1 \ 

systematic error (ps ) 


r 


0.097 to 0.108 [10] 


-0.00021 


r 

iB bar yon 


0.103 to 0.12 [10J 


0.00039 




1.55 to 1.60 ps [10] 


0.0001 




1.65 to 1.67 ps [10] 


-0.0008 


r B d 


1.55 to 1.58 ps [10] 


0.0012 


baryon 


1.2 to 1.25 ps [10] 


-0.0008 


uds background scale factor 


1 to 1.10 


-0.00022 


charm background scale factor 


1 to l.lU 


U.UUUOz 


tagging factor AC / C 


variation 5% (15%) 


0.0006 


scale factor proper time 


1 to 1.01 


-0.0049 


resolution smearing 




0.0037 


Total systematic error 




0.0067 



Table 6: The systematic errors affecting the Arrid measurement. 



and similarly: 



2.8 Search for B° - oscillations 

To search for B° s — B° s oscillations a likelihood fit was performed, where the negative 
log-likelihood is defined as: 

£ — ^ ^ ln{^P (treci Pcombi Pdecay)) ^ ^ ln{T* {treci Pprodi -f deem/)) i (22) 

like— sign unlike— sign 

where the expression for p hke and r p unllke can be found in Eqs. (13) and (14). To extract 
results from this fit the so-called amplitude method was used [11]. For the mixed and 
unmixed B° events the following expressions were used: 

V u nmix . = J_ e -^[i + ^co S (Am a t)] (23) 
2r Bs 

V mix. = J_ e -^[i_Acos(Am a t)]- (24) 
2r Bs 

B s oscillations will correspond to a value A of unity. The oscillation amplitude A and its 
error a a were fitted to the data as a function of Am s . The result of the amplitude fit is 
shown in Figures 12 and 13. 

Before discussing the result and its interpretation, the systematic errors have been 
studied. This was done by changing one parameter at a time (for example /bJ and 
redoing the full amplitude fit. The systematic error was then evaluated as [11]: 

^.stat -.stat 

< VSt = A 1 -A + (1- Ao) M stat Ao , (25) 

a A 1 

where Aq(Ai) and o"A (o"yii) denote the fitted amplitude and error before (after) changing 
the parameter. The last term in Eq.(25) takes into account the change in the error of the 
fitted amplitude. The following parameters have been changed as in Table 6: 

• / Bb from 0.097 to 0.108, 

• the uds and charm backgrounds have been scaled up by 10%, 

• the tagging purity has been changed by varying the correction factor C by 5% for the 
soft lepton tag and by 15% for the inclusive vertex tag, 
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• the constant term a for 71, the width of the resolution function (see section 2.5), has 
been changed by a relative 10%, 

• <jp, the width of the resolution function for the momentum, has been changed by a 
relative 10%. 

The total systematic error as a function of Am s is shown in Figure 12b. It is at most 
35% of the statistical error. 

Using the results for the amplitude and its error it is possible to obtain the 95% CL 
exclusion region or sensitivity. This region corresponds to A + 1.645 a a < 1- This curve 
is shown in Figures 12 and 13. No B° — B° oscillations have been observed in the data. 
A limit on the mass difference of the two physical B° states can be put: 

Am, > 5.0 ps" 1 at 95 % CL. 

Using the error on A, a a, the sensitivity is found to be: 

Sensitivity = 6.6 ps _1 . 

The sensitivity would be 6.8 ps _1 if the systematic error on the amplitude was neglected 6 . 



6 In Table 7 the results arc given for the amplitude and its error as a function of Am s after adjusting /b s to the recently 
published value of 0.106 [13]. 



25 
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Figure 12: a) Fitted values of the oscillation amplitude A as a function of Am s . The 
horizontal line corresponds to the value A=l. The black band is situated between the 
curves for A + 1.645 cr j 4 stat and A + 1.645 oj^ tot . b) The total amplitude error as a function 
of Am s . The upper band is situated between the statistical error (crA st at) an d the total 
error (<JA tot )- The lower curve shows the systematic error . The crossing point with 
the dashed line of the rising curve for the total error with ct^=1/1.645 at Am s = 6.6 ps -1 
gives the sensitivity. 
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Figure 13: Fitted values of the oscillation amplitude A as a function of Am s . The data are 
identical to those of Fig. 12. The dashed horizontal line corresponds to A—l. The black 
band is situated between the curves for A + 1.645a A stat and A + 1.645 cr^ tot . The smoothly 
rising curve corresponds to 1.645 (JA tot - The crossing point with A—l at Am s = 6.6 ps _1 
gives the expected lower limit at 95% CL. 
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Am s (ps 1 ) 


A 


a a (stat) 


a a (total) 


0.25 


-0.17 


0.11 


0.13 


0.75 


0.12 


0.15 


0.17 


1.25 


0.46 


0.18 


0.20 


1.75 


0.35 


0.21 


0.23 


2.25 


0.13 


0.23 


0.26 


2.75 


0.16 


0.25 


0.28 


3.25 


0.19 


0.28 


0.31 


3.75 


0.26 


0.31 


0.33 


4.25 


0.23 


0.34 


0.37 


4.75 


0.13 


0.37 


0.40 


5.25 


0.39 


0.41 


0.43 


5.75 


0.65 


0.45 


0.47 


6.25 


0.30 


0.50 


0.53 


6.75 


-0.26 


0.56 


0.58 


7.25 


-0.53 


0.62 


0.65 


7.75 


-0.61 


0.69 


0.71 


8.25 


-0.71 


0.76 


0.78 


8.75 


-0.90 


0.85 


0.87 


9.25 


-1.02 


0.95 


0.96 


9.75 


-0.72 


1.05 


1.06 


10.25 


0.02 


1.16 


1.17 


10.75 


0.53 


1.28 


1.30 


11.25 


0.54 


1.43 


1.45 


11.75 


0.36 


1.59 


1.62 


12.25 


0.39 


1.77 


1.82 


12.75 


0.81 


1.96 


2.02 


13.25 


1.42 


2.17 


2.23 


13.75 


1.98 


2.40 


2.44 


14.25 


1.90 


2.65 


2.69 


14.75 


0.65 


2.91 


2.96 


15.25 


-1.11 


3.17 


3.22 


15.75 


-2.04 


3.44 


3.48 


16.25 


-1.36 


3.74 


3.84 


16.75 


0.88 


4.08 


4.30 


17.25 


3.90 


4.45 


4.77 


17.75 


6.61 


4.85 


5.14 


18.25 


8.43 


5.27 


5.44 


18.75 


9.36 


5.69 


5.79 


19.25 


9.66 


6.12 


6.23 


19.75 


8.99 


6.53 


6.68 


20.25 


8.22 


6.92 


7.09 


20.75 


8.00 


7.33 


7.47 


21.25 


8.02 


7.79 


7.93 


21.75 


8.63 


8.31 


8.52 


22.25 


10.32 


8.89 


9.19 


22.75 


12.47 


9.48 


9.84 


23.25 


14.58 


10.05 


10.43 


23.75 


16.20 


10.62 


11.03 


24.25 


17.11 


11.21 


11.68 


24.75 


17.77 


11.82 


12.38 



Table 7: The amplitude and its statistical and systematic error as a function of Am. 
after adjusting / Bs to the published value of 0.106 [13]. 
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3 A neural network analysis 

The inclusive B° s analysis described in this section was an attempt to optimize the 
statistical precision attainable in the high Am s region. This analysis made extensive 
use of neural network techniques for tagging and vertex reconstruction, mostly based on 
the BSAURUS [12] package. Several neural networks were used on the event and track 
level. For optimal performance a good resolution on the proper time was required and 
this was achieved by keeping the energy and the vertex reconstruction separated in the 
analysis. The separated treatment of decay length and energy reconstruction led to a 
CPU intensive two-dimensional integration for each event. Only the best class of events 
(in terms of the decay length resolution) was used, to reach an optimal performance for 
high Am s values. The restrictive cuts on quality and decay length resolution led to a 
sample of only 30 k events for the data taken in 1994. 

3.1 Event selection 

Multihadronic Z° events were selected requiring at least 5 reconstructed tracks and 
a total reconstructed energy larger than 12% of the centre-of-mass energy. The event 
was rejected if it had more than 3 jets or if the value of | cos(9 t hrust)\ was larger than 
0.75. The cosine of the opening angle between the two most energetic jets was required 
to be less than —0.98. Further, the value of the combined event b-tagging variable x ev 
as defined in ref. [5] had to be larger than 0.5. Events having an identified lepton with 
a transverse momentum larger than 1.2 GeV/c were removed. To obtain a homogeneous 
data set, it was required that both the liquid and gas radiators of the Barrel RICH were 
fully operational. 

The same selection was applied to simulated Z — > qq events using the JETSET 7.3 [6] 
generator. 

Each event was split into hemispheres using the plane perpendicular to the thrust axis. 
A first estimate of the B candidate momentum vector was obtained by calculating the 
charged particles rapidities, with respect to the thrust axis, and by summing the momenta 
of those with rapidity > 1.6. In each hemisphere a secondary vertex was fitted using the 
tracks with vertex detector hits from high rapidity charged particles. The secondary 
vertex fit was performed in three dimensions using as a constraint the direction of the B 
candidate momentum vector. The result of the vertex fit was used as an input to a Neural 
Network, the so-called TrackNet, that distinguishes between a fragmentation track and 
a track originating from a weakly decaying B hadron. In the final stage of the fit, the 
TrackNet output was used to add candidate tracks to the secondary vertex and the fit 
was redone. 

Finally, a hemisphere was rejected if the secondary vertex fit did not converge. 

3.2 Flavour tagging 

The tagging of the quark flavour at production and decay times is necessary to dis- 
tinguish mixed from unmixed B° mesons. Only the opposite hemisphere was used for 
the production tag to reduce correlations between the production and decay tags. The 
decay tag was based on track-by-track flavour nets, which were later combined using a 
likelihood ratio to tag the presence of a B or B meson at decay time in each hemisphere. 
For the production tag a dedicated neural network was used. 
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3.3 The track-by-track flavour nets 

Eight different networks were trained corresponding to a production and a decay 
ffavour network for each of the four B hadron types. The aim of each network was 
to exploit, track- by-track, the correlation between the charge of a single track and the 
b quark charge. This approach is motivated by the different decay chains of the various 
types of B hadrons where, for example, the 'charge' of the D meson determines the b 
quark charge. 

The discriminating input variables are: particle identification (e.g. kaon, proton and 
lepton probabilities), B-D vertex separation based on a network trying to discriminate 
between tracks originating from the weakly decaying B hadron and those from the subse- 
quent cascade D meson decay, the momenta in the B rest frame and variables related to 
tracking quality. The track decay flavour nets use 21 input variables in total, while the 
track production flavour nets have 18 input variables; essentially the same input variables 
without the lepton identification and the B-D net variables. 

To obtain a flavour tag in a given hemisphere the individual track probabilities 
P{track)\ (i = B u , B d , B s , B baryon and j = production or decay) coming from the dif- 
ferent networks were combined in the following way, 



where q(track) is the charge. For the production flavour tag, tracks with TrackNet values 
less than 0.5 are selected, while for the decay flavour tag, tracks must have a TrackNet 
value above 0.5. 

3.4 The B° s product ion and decay flavour tag 

The production flavour net was constructed using all the information available in the 
hemisphere, i.e. the fragmentation and decay flavour probabilities P(hem)^°^^ Bh ^^, 
and the quality of the information for the selected hemisphere. More details on the flavour 
networks and on the flavour tag can be found in [12]. 

In Figure 14a the probability distribution for the production tag for 1994 data and 
simulation is shown. The grey lines indicate the distributions for b and b quarks. The 
achieved tagging purity on simulation is 71% at 100% efficiency 

For the B° decay flavour tag, the probability P(hem) d ^o ay was used. In Figure 14b the 

B° decay flavour probability distributions for 1994 data and simulation are shown. The 
tagging purity on simulation was 62% at 100% efficiency The contributions from light 
and charm quarks are very small due to the high b purity of the sample of 98.3 %. 

3.5 Energy reconstruction 

To determine the proper time, a precise estimate of the energy of the decaying B 
hadron is needed. The starting point was a raw estimate of the B energy E raw and mass 
m Taw . These quantities were determined by weighting (with a sigmoid threshold function) 
the momentum and energy components of the charged particles by the TrackNet output 
value and the neutral particles by their rapidity. For three-jet events only the rapidity 
was used as a weight. In this way particles coming from the decaying B hadron receive 
a higher weight. 




(26) 
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Tagging probability Tagging probability 

Figure 14: The production (a) and decay (b) tag distributions for 1994 data and simula- 
tion. The distributions for b and b are indicated in light and dark grey. The decay tag 
is optimized for B s and B s particles. 



The raw energy was corrected as a function of m raw and of the fraction of the energy 
in the hemisphere, Xh, to obtain an improved estimate of the energy. This was done in 
the following way. The simulated data were divided into several samples according to 
the measured ratio Xh and for each of these samples the AE, defined as the true energy 
minus the raw energy, was plotted as function of m raw . The median values of AE in 
each bin of m raw were calculated and the m raw dependence was fitted by a third order 
polynomial: 

AE(m raw , x h ) = a + b(m raw - (m raw )) + c(m raw - (m raw )) 2 + d(m raw - (m raw )) 3 . (27) 

The four parameters a, b, c, d in each Xh bin were then studied as functions of Xh and 
parametrized with third and second-order polynomials. In this way a smooth correction 
function was obtained. 

This procedure led to an estimate of the B hadron energy. Studies on simulated 
B events showed a large correlation between the number of tracks and the B energy 
resolution. For this reason, the number of tracks in the hemisphere was chosen to define 
different resolution classes. In total 16 different classes were defined, starting with 2 
tracks per hemisphere in class one and ending with 17 and more tracks in class 16. The 
central Gaussian of a double Gaussian fit to the B energy resolution varies from 4% in 
the best class to 15% in the worst. 

3.6 Decay length reconstruction 

Starting from the secondary vertex algorithm, described in section 3.1, an optimized 
algorithm was developed with the aim of improving the decay length resolution and of 
minimizing the forward bias resulting from the inclusion of tracks from the cascade D 
decay vertex in the B decay vertex reconstruction. Based on the output of the B-D net, 
a so-called 'Stripping' algorithm was developed. 

For the 'Stripping' algorithm candidate tracks were selected if they had a TrackNet 
output larger than 0.5 and a B-D net output value less than 0.45. The B-D net cut 
value corresponds to an efficiency of 50% for selecting a track from a weakly decaying B 
hadron at a purity of 75%. A secondary vertex fit was performed if two or more tracks 
were selected. If the fit failed to converge within the algorithm criteria and more than 
two tracks were selected, the track with highest x 2 contribution was removed and the fit 
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particle 


decay tag old 


particle 


production tag ap 




1 


b quarks 


0.94 


B d 


1.08 


c quarks 


0.56 


B„ 


1.15 


uds quarks 


0.84 


B baryon 


0.93 






c quarks 


1.05 






uds quarks 


0.08 







Table 8: The a parameters for the decay and production tag for the different particles 
as obtained from the 1994 simulation 



was repeated. This procedure was done iteratively until convergence was reached or two 
tracks were left. Finally, the direction of the B, as estimated by the B energy algorithm, 
was used as a constraint. The overall efficiency to find a vertex was about 50%. 

Events with a very good decay length resolution were selected by requiring that the 
expected error on the decay length was smaller than 200 jim. 

Because of cuts on the TrackNet output, on the B-D output and on the expected 
decay length error, less events will be reconstructed at small decay length. Therefore 
an acceptance function depending on the true B decay length was calculated using the 
simulation. 

After having applied these cuts, 30k hemispheres were selected in the 1994 data sample. 
The b purity of the sample was estimated from simulated events to be 98.3%. 

3.7 The likelihood fit 

In the fitting program, the like- and unlike-sign events were separated in the same way 
as described in section 2.5 of the previous analysis and the same expressions for like- and 
unlike-sign probabilities were used. 

A difference from the previous analysis was the treatment of the resolution functions 
Tt(lrec — krue, krue) and 1Z({p rec — Ptrue) / Ptme) , which were kept separated. As a param- 
eterization for the decay length / two asymmetric Gaussian distributions were chosen, 
while for the momentum reconstruction two symmetric Gaussian distributions were used. 
The probability for a B event to be observed at a proper time P(t rec ) is a convolution over 
an exponential B decay distribution, an acceptance function A(l,p), the true B hadron 
momentum distribution F{p) and the resolution functions IZi and 1Z P , all four taken from 
simulation: 

poo poo ^ — lm/(rp) 

v b (t rec )= / A{i,p)F(p)n l {i rec -i,i)n p {{ Prec -p)/p) didp, (28) 

J 1=0 Jp=0 r 

were r denotes the B lifetime and Eq. (8) was used to calculate the proper time. 

3.8 Modelling simulation and data 

As explained in section 2.6 it is important to model precisely the tagging purities. In 
this analysis the raw purities were modified using a parameter a as defined in Eq. (17). 
The decay and production tag parameters for the different particles were obtained from 
simulation, and are listed in Table 8. 
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Figure 15: In the left plot the fitted B s oscillation amplitude for the NN analysis is shown 
as a function of Am s as points and error bars. The continuous (dotted) lines correspond 
to A ± 1.645<Tyi 3tat(tot) . The dashed line corresponds to 1.645o"A tot - The plot on the right 
side shows the fraction of weighted like-sign tagged events as a function of the proper 
time. The data are shown as points with error bars, the parametrization is given as a 
solid line. 



For the real data, the correction factor C, defined in Eq. (19), was determined from 
the fraction of like-sign events, using the same method as was discussed in section 2.6. 
Two correction factors were needed, one for B u mesons and one for the other B mesons. 
Their values were Cb u = 0.53 and C = 0.81. 

Using the amplitude method [11] the result shown in Figure 15a was obtained. A limit 
on Am s was not extracted as the analysis was optimized for high values of Am s . Figure 
15b shows the agreement between the data and the description by the fitting programme. 

Systematic uncertainties have been evaluated by varying a single parameter at a time 
(e.g /b s ) and redoing the full amplitude fit. The systematic error was then calculated as 
defined in Eq. (25). The same parameters as described in section 2.8 were varied and 
the systematic error was determined to be at most 25% of the statistical error. 

The error on the fitted B° s amplitude at Am s of 15 and 20 ps -1 gives respectively 5.1 
and 11.8 for this analysis using only 1994 data. This can be compared with the values of 
5.0 and 10.9 obtained with the previous analysis using only the 1994 data sample. The 
results of the neural network analysis optimized for high values of Am s are compatible 
with the results for the 1992-2000 data shown in section 2.8. No attempt is made to 
combine the results. 
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4 Conclusion 

Using a total sample of 770 k events - of which 155 k events contain a soft lepton - 
the mass difference between the two physical states in the B° d — B° d system was measured 
to be: 

Am d = (0.531 ± 0.025(stat.) ± 0. 007 (syst.))^' 1 . 

The following limit on the width difference between the two states was obtained: 

|Ar Bd |/r Bd < 0.18 at 95% CL. 

As no evidence for B° — B° oscillations was found, a limit on the mass difference of 
the two physical states was given: 

Am s > 5.0 ps- 1 at 95 % CL 

with a sensitivity equal to 6.6 ps _1 . 

These results are compatible with a neural network analysis optimized for high values 
of Am s . 
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